Federated learning based on stratified sampling and regularization

نویسندگان

چکیده

Abstract Federated learning (FL) is a new distributed framework that different from traditional machine learning: (1) differences in communication, computing, and storage performance among devices (device heterogeneity), (2) data distribution volume (data (3) high communication consumption. Under heterogeneous conditions, the of clients varies greatly, which leads to problem convergence speed training model decreases cannot converge global optimal solution. In this work, an FL algorithm based on stratified sampling regularization (FedSSAR) proposed. FedSSAR, density-based clustering method used divide overall client into clusters, then, some available are proportionally extracted clusters participate realizes unbiased for reduces aggregation weight variance client. At same time, when calculating local loss function, we limit update direction by regular term, so optimized globally direction. We prove FedSSAR theoretically experimentally, demonstrate superiority comparing it with other algorithms public datasets.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Stratified Sampling Meets Machine Learning

This paper solves a specialized regression problem to obtain sampling probabilities for records in databases. The goal is to sample a small set of records over which evaluating aggregate queries can be done both efficiently and accurately. We provide a principled and provable solution for this problem; it is parameterless and requires no data insights. Unlike standard regression problems, the l...

متن کامل

Stratified Sampling Design Based on Data Mining

OBJECTIVES To explore classification rules based on data mining methodologies which are to be used in defining strata in stratified sampling of healthcare providers with improved sampling efficiency. METHODS We performed k-means clustering to group providers with similar characteristics, then, constructed decision trees on cluster labels to generate stratification rules. We assessed the varia...

متن کامل

Support Vector Machine based on Stratified Sampling

Support vector machine is a classification algorithm based on statistical learning theory. It has shown many results with good performances in the data mining fields. But there are some problems in the algorithm. One of the problems is its heavy computing cost. So we have been difficult to use the support vector machine in the dynamic and online systems. To overcome this problem we propose to u...

متن کامل

On stratified randomized response sampling

In this paper, we propose a new quantitative randomized response model based on Mangat and Singh [7] two-stage randomized response model. We derive the estimator of the sensitive variable mean, and show that our method is more efficient than other randomized response models suggested by Greenberg et al. [3] and Gupta et al. [4] estimators.

متن کامل

Stratified Median Ranked Set Sampling: Optimum and Proportional Allocations

In this paper, for the Stratified Median Ranked Set Sampling (SMRSS), proposed by Ibrahim et al. (2010), we examine the proportional and optimum sample allocations that are two well-known methods for sample allocation in stratified sampling. We show that the variances of the mean estimators of a symmetric population in SMRSS using optimum and proportional allocations to strata are smaller than ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Complex & Intelligent Systems

سال: 2022

ISSN: ['2198-6053', '2199-4536']

DOI: https://doi.org/10.1007/s40747-022-00895-3